On the Evaluation of Rhythmic and Melodic Descriptors for Music Similarity
نویسندگان
چکیده
In exploratory studies of large music collections where often no ground truth is available, it is essential to evaluate the suitability of the underlying methods prior to drawing any conclusions. In this study we focus on the evaluation of audio features that can be used for rhythmic and melodic content description and similarity estimation. We select a set of state-of-the-art rhythmic and melodic descriptors and assess their invariance with respect to transformations of timbre, recording quality, tempo and pitch. We create a dataset of synthesised audio and investigate which features are invariant to the aforementioned transformations and whether invariance is affected by characteristics of the music style and the monophonic or polyphonic character of the audio recording. From the descriptors tested, the scale transform performed best for rhythm classification and retrieval and pitch bihistogram performed best for melody. The proposed evaluation strategy can inform decisions in the feature design process leading to significant improvement in the reliability of the features.
منابع مشابه
The musical language Elements of Persian musical language: modes, rhythm and syntax
In treating the subject of musical language, a Persian musician would be intrinsically drawn to the structural similarities between the Persian music and language. Indeed Persian music and language are extremely related in their metrics, intonations and structural phrases (syntax). Although we will draw upon this relationship, our aim in this article is to present “music as a language,” c...
متن کاملNovel Mid-Level Audio Features for Music Similarity
Large-scale systems for automatic content-based music recommendation require efficient computation of signal descriptors that are robust and relevant with regard to human perception in order to process extensive music archives. In this publication, a set of mid-level audio features suitable for efficient characterization of musical signals with regard to automatic music similarity estimation is...
متن کاملThe Flamenco Cante: Automatic Characterization of Flamenco Singing by Analyzing Audio Recordings
Flamenco singing is a highly expressive improvisational artform characterized by its deviation from the Western tonal system, freedom in rhythmic interpretation and a high amount of melodic ornamentation. Consequently, a singing performance represents a fusion of style-related constraints and the individual spontaneous interpretation. This study focuses on the description of the characteristics...
متن کاملHomayoun as a Persian Music Scale on Non-Musician’s Brain: an fMRI Study
Introduction: The aim of this study was to get to a neurological evaluation of one of the Persian music scales, Homayoun, on brain activation of non-musician subjects. We selected this scale because Homayoun is one of the main scales in Persian classical music which is similar to minor mode in western scales. Methods: This study was performed on 19 right handed subjects, Aging 22-31. Here some ...
متن کاملA Mid-level Melody-based Representation for Calculating Audio Similarity
We propose a mid-level melody-based representation that incorporates melodic, rhythmic and structural aspects of a music signal and is useful for calculating audio similarity measures. Most current approaches to music similarity use either low-level signal features, such as MFCCs that mostly capture timbral characteristics of music and contain little semantic information, or require symbolic re...
متن کامل